-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MINOR][INFRA] Ensure that docs build successfully with SKIP_API=1 #44627
Conversation
cc @HyukjinKwon |
@@ -761,6 +761,9 @@ jobs: | |||
run: ./dev/lint-r | |||
- name: Run documentation build | |||
run: | | |||
# Build docs first with SKIP_API to ensure they are buildable without requiring any | |||
# language docs to be built beforehand. | |||
cd docs; SKIP_API=1 bundle exec jekyll build; cd .. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM but just to confirm this doesn't take a lot of time right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Correct. This adds around 5 seconds to the "Run documentation build" step.
@@ -761,6 +761,9 @@ jobs: | |||
run: ./dev/lint-r | |||
- name: Run documentation build | |||
run: | | |||
# Build docs first with SKIP_API to ensure they are buildable without requiring any | |||
# language docs to be built beforehand. | |||
cd docs; SKIP_API=1 bundle exec jekyll build; cd .. | |||
if [ -f "./dev/is-changed.py" ]; then | |||
# Skip PySpark and SparkR docs while keeping Scala/Java/SQL docs | |||
pyspark_modules=`cd dev && python3.9 -c "import sparktestsupport.modules as m; print(','.join(m.name for m in m.all_modules if m.name.startswith('pyspark')))"` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
or wonder if we should set SKIP_SCALADOC=1
together with if [
./dev/is-changed.py -m ...` condition. That might be better.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hmm, could you elaborate a little bit on this? How would this prevent yesterday's scenario?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actually let's just merge this. 5 secs won't affect the elapsed time much anyway.
### What changes were proposed in this pull request? Add [custom Jekyll tags][custom] to enable us to conditionally include files in our documentation build in a more user-friendly manner. [This example][example] demonstrates how a custom tag can build on one of Jekyll's built-in tags. [custom]: https://github.com/Shopify/liquid/wiki/Liquid-for-Programmers#create-your-own-tags [example]: Shopify/liquid#370 (comment) Without this change, files have to be included as follows: ```liquid {% for static_file in site.static_files %} {% if static_file.name == 'generated-agg-funcs-table.html' %} {% include_relative generated-agg-funcs-table.html %} {% break %} {% endif %} {% endfor %} ``` With this change, they can be included more intuitively in one of two ways: ```liquid {% include_relative_if_exists generated-agg-funcs-table.html %} {% include_api_gen generated-agg-funcs-table.html %} ``` `include_relative_if_exists` includes a file if it exists and substitutes an HTML comment if not. Use this tag when it's always OK for an include not to exist. `include_api_gen` includes a file if it exists. If it doesn't, it tolerates the missing file only if one of the `SKIP_` flags is set. Otherwise it raises an error. Use this tag for includes that are generated for the language APIs. These files are required to generate complete documentation, but we tolerate their absence during development---i.e. when a skip flag is set. `include_api_gen` will place a visible text placeholder in the document and post a warning to the console to indicate that missing API files are being tolerated. ```sh $ SKIP_API=1 bundle exec jekyll build Configuration file: /Users/nchammas/dev/nchammas/spark/docs/_config.yml Source: /Users/nchammas/dev/nchammas/spark/docs Destination: /Users/nchammas/dev/nchammas/spark/docs/_site Incremental build: disabled. Enable with --incremental Generating... Warning: Tolerating missing API files because the following skip flags are set: SKIP_API done in 1.703 seconds. Auto-regeneration: disabled. Use --watch to enable. ``` This PR supersedes #44393. ### Why are the changes needed? Jekyll does not have a succinct way to [check if a file exists][check], so the required directives to implement such functionality are very cumbersome. We need the ability to do this so that we can [build the docs successfully with `SKIP_API=1`][build], since many includes reference files that are only generated when `SKIP_API` is _not_ set. [check]: jekyll/jekyll#7528 [build]: #44627 ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Manually building and reviewing the docs, both with and without `SKIP_API=1`. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #44630 from nchammas/SPARK-46437-conditional-jekyll-include. Authored-by: Nicholas Chammas <[email protected]> Signed-off-by: Hyukjin Kwon <[email protected]>
Merged to master. |
What changes were proposed in this pull request?
This PR tweaks the docs build so that the general docs are first built with
SKIP_API=1
to ensure that the docs build works without any language being built beforehand.Why are the changes needed?
Committers expect docs to build with
SKIP_API=1
on a fresh checkout. Yet, our CI build does not ensure this. This PR corrects this gap.Does this PR introduce any user-facing change?
No.
How was this patch tested?
Via test commits against this PR.
The build now fails if the docs reference an include that has not been generated yet.
Was this patch authored or co-authored using generative AI tooling?
No.